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HUMAN SUBSTANCE P RECEPTOR 
Backgrou nd of the Invention 

" 5 

This invention relates to the human substance P 
receptor. More particularly, the invention concerns the 
molecular cloning and functional expression of the human 
substance P receptor and a novel stable cell line that 
10 expresses large numbers of the recombinant substance P 
receptor . 

Substance P (SP) is a peptide neurotransmitter 
and neuromodulator originally detected in 1931 based on 
its smooth muscle contractile activity (1) • In 1971 it 

15 was isolated based on its sialagogic activity, and its 
primary structure was established as Arg-Pro-Lys-Pro- 
Glu-Gln-Phe-Phe-Gly-Leu-Met-NH2 (2) [SEQIDNO:!]. SP 
has since been shown to participate in the regulation of 
diverse biological activities (3,4,5), and it is an 

20 excitatory agent released from central, peripheral and 
gastrointestinal neurons. In addition, SP regulates 
certain endocrine and exocrine gland secretions, it aids 
in the regulation of blood pressure by acting at both 
central and peripheral sites, and it has been suggested 

25 to be involved in the regulation of some immunological 
disorders and certain inflammatory states. It is now 
well established that the biological actions of SP are 
mediated largely via a receptor that interacts 
specifically with the conserved tachykinin ceurboxyl 

30 terminal domain. The specific amino terminal sequences 
of the mammalian tachykinin peptides dictate receptor 
affinity and selectivity. Ligand interaction with the 
SPR activates guanyl nucleotide binding protein 
dependent second messenger systems that mediate the 

35 specific biological response. Recently, Yokota et al. 
(6) and Hershey and Krause (7) molecular ly characterized 
and functionally expressed the rat SP receptor (SPR) , 
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and established it to be a member of the G-protein 
coupled receptor super family. 

References cited in parentheses herein are listed 
at the end of the specification. 

5 

Brief Description of the Invention 

In accordance with the present invention a cDNA 
encoding the novel human substance P receptor has been 

10 isolated and characterized, and nucleotide sequence 
analysis has been used to deduce the primary structure 
of the receptor protein. The human substance P receptor 
consists of 407 amino acid residues and is a member of 
the G-protein coupled receptor superfamily. Comparison 

15 of the novel humem and the prior art rat substance P 
receptor amino acid sequences demonstrated that they 
have a 94.5% identity that is largely evident in 
transmembrane domains, and in intracellular domains. 
In accordance with another aspect of the 

20 invention, the novel hiaman substance P receptor was 
transiently expressed from plasmid pH^hSPR in a COS-7 
cell line and showed a Kd for Tyr'^-substance P binding 
of 0.24 nM. A clonal cell line stably expressing the 
novel human substamce P receptor from plasmid pM^hSPR was 

25 created in a CHO cell background, said cell line being 
designated herein as CHO-pM^-hSPR #10. This cell line 
e^tpresses 500,000 substance P receptors per cell with an 
affinity of 0.29 nM. A cultiire of this cell line is on 
deposit xmder the Budapest Treaty with the American Type 

30 Culture Collection, Rockville, MD, under accession 
number ATCC CRL 10824. 

Although the invention is particularly 
illustrated by the use of CHO cells (Chinese hamster 
ovary) and COS-7 cells (monkey kidney, SV40 transformed) 

35 which contain the human substance P receptor directed 
from the expression of the plasmid pH^hSPR, it will be 
understood that other conventional cell lines, e.g., 
murine cells, HeLa cells, canine cells and the like, can 
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similarly be used as host cells for expressiqn of the 
substance P receptor. 

The pattern of ligand displacement by naturally 
occurring tachykinin peptide was substance P»neurokinin 
5 A>neurokinin B. Ligand stimulation of transfected cells 
results in a rapid and transient inositol 1,4,5- 
triphosphate response • RNA blot hybridization and 
solution hybridization demonstrated that the natxirally 
expressed human substance P receptor mRNA was about 4.5 

10 Kb in size, and was expressed in IM-9 lymphoblast and 
U373-HG astrocytoma cells, as well as in spinal cord and 
lung but not in liver. These results demonstrate that 
the human substance P receptor is expressed in many 
places and it mediates the many diverse functions of 

15 human substance P. 

Cell lines containing the human substance P 
receptor cDNA are useful for examining cellular 
mechanisms regulating human substance P receptor mHNA 
expression and for screening for antagonists of human 

20 substance P such as may be useful for central, 
peripheral and gastrointestinal system disorders, 
inflammation and immune disorders. Since tissues in the 
human body that express the substance P receptor express 
only about 5-10,000 receptors per cell, the 

25 substantially and significantly higher expression system 
of about 500,000 receptors per cell in accordance with 
the present Invention permits rapid and faster screening 
of candidate compounds acting at the substance P 
receptor. The human substance P receptor also is useful 

30 as a diagnostic approach for identifying abberrant 
receptor sequences in humem disease states. 

Detailed Description of the Invention 

35 While the specification concludes with claims 

particularly pointing out and distinctly claiming the 
subject matter regarded as forming the present 
invention, it is believed that the invention will be 
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better understood from the following detailed 
description of preferred embodiiaents of the invention in 
conjxinction with the appended drawings, in which 
briefly: . 

5 

FIG. 1 shows the nucleotide sequence and deduced 
amino acid sequence [SEQ ID NO: 3] of the human substance 
P receptor. Nucleotide numbering shown on the right 
side starts with +1 beginning with A of the initiator 
10 methionine codon. Amino acid sequence is numbered below 
the displayed sequence. The putative a-helical 
transmembrane domains labeled MI-MVII are underlined. 



FIG. 2 shows the expression of the human 
15 substance P receptor in COS-7 cells in two bar graph 
panels. A. Comparison of ^^l-Tyr'^-SP binding to 
nontransfected cells and to cells transfected with a 
plasmid encoding either the human SPR, the hiaman SPR in 
the antisense orientation or the rat SPR. B. 
20 Competition of '^I-Tyr"^-SP binding by naturally 
occurring and synthetic tachykinin peptides. 
Transf ection conditions and ligand binding were 
performed with 0.1 nif ^^I-Tyr'^-SP as described in 
Methods hereinbelow. Each datum represents the X ± SEH 
25 of four duplicate determinations performed with 
different prepsurations of transfected cells* 

FIG. 3 is a graphical representation which shows 
the displacement of '^I-Tyr'^-SP binding to transfected 

30 COS-7 cells by the naturally occurring tachykinins: 
substance P, neurokinin A and neurokinin B. 
Transfection conditions and ligand binding were 
performed with 0.1 nM ^^I-Tyr'^-SP as described in * 
Methods hereinbelow. Each datum point represents the X± 

35 of four determination's performed in duplicate. The SEM 
was less than 5% for all data presented. 
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FIG. 4 is a graphical representation which shows 
the saturation analysis of '^l-Tyr*'-SP binding to 
transfected COS-7 cells. Cells were transfected and 
ligand binding was performed as described in Methods 

5 hereinbelow. The data shown are from fotir 

determinations performed in duplicate on separate 
transfected cell preparations. The variation in ligand 
concentration for all data points was less than 3% of 
the mean concentration shown. The calculated Kd and 

0 values were o.24± and 15l,000± per cell, respectively. 

FIG. 5 is a graphical representation which shows 
a human substance P stimulated inositol trisphosphate 
response as a function of time after stimulation of 

5 transfected COS-7 cells. Cells were transfected, 
harvested and stimulated with 1 iM human substance P, 
and inositol trisphosphate levels were determined as 
described in Methods hereinbelow. The data shown 
represents the results from a single transfection and 

) stimulation test. Similar results were obtained in a 
repeat of the test. 



FIG, 6 shows the analysis of human substance P 
receptor mRNA expression patterns by RNA blot and 
25 solution hybridization methods. The upper left shows 
the RNA blot results, the upper right shows the solution 
hybridization-nuclease protection results, and the lower 
portion illustrates the probes used. For RNA blots, 2 
Hg poly (A)* RNA was denatured, electrophoresed on 1* gels 
and transferred to nitrocellulose as described in 
Methods hereinbelow. For solution hybridization, 25 fig 
total RNA was annealed with the coding region probe, and 
non-hybridized probe was digested with RNases A and T, 
as described in Methods hereinbelow. An autoradiogram 
35 of protected species after electrophoresis on a 

denaturing 6% polyacrylamide gel is shown. Standards 
for the RNA blot were 0.24 to 9.5 Kb RNA ladder (BRL, 



30 
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Gaithersburg, MD) , and stcindards for the nuclease 
protection gel were radiolabeled Mspl-digested pBR322« 

FIG, 7 shovs a comparison of the amino acid 
5 sequences of human substance P receptor [SEQ ID NO: 3] 
and rat substance P receptor [SEQ ID NO: 7]. Identical 
residues between the two sequences are indicated by the 
vertical line. Putative membrane spanning domains MI- 
M7II are over lined. The closed triangles indicate 
10 consensus N-linked glycosylation sites, the filled 
circles indicate potential intracellular serine and 
threonine phosphorylation sites, and the arrow depicts a 
potential palmitoylation site. 

15 Standard biochemical nomenclatxire is used herein 

in which the nucleotide bases are designated as adenine 
(A) ; thymine (T> ; guanine (G) ; and cytosine (C) . 
Corresponding nucleotides are, for example, 
deoxyadenosine-5" -triphosphate (dATP) . Amino acids are 

20 shown either by conventional three or one letter 
abbreviations as follows: 



wo 93/03137 



PCr/US92/06532 



Abbreviated Designation Amino Acid 



5 



10 



15 



20 



A 


Ala 


AXGimne 


c 


Cys 




n 


Asp 


AspfUrclc acid 


E 




GiurajBic acid 


F 




DY^AMT^I a1 aw 4** a 

x'lienyxaxanine 


ti 




Glycine 


IT 

n 


nlS 


Hlstidine 




lie 


Isoleuclne 




XtjrS 


Lysine 


T. 
iJ 


lieu 


Leucine 


H 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


W 


Trp 


Tryptophan 


Y 


Tyr 


Tyrosine 



25 

In order to illustrate specific preferred 
embodiments of the invention in further detail^ the 
following exemplary laboratory work was carried out. 
References to pxiblications cited herein in parenthesis 

30 are listed at the end of the specification- Although 
specific examples are illustrated herein, it will be 
appreciated that the invention is not limited to these 
specific examples or the details described therein. 
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Materials and Methods 

5 Materials > Most reagents used herein are 

conventional and have been described previously (7, S, 
9} • The plasmid pM^ was obtained from Dr. Irving Boime, 
Washington University School of Medicine (10) • 
Oligonucleotides for secpience analysis were obtained 

10 from the Washington University Protein Chemistry 

Facility. IM-9 ixomunoblast cells were obtained from 
Drs. Normsm Boyd and Sus2Ui Leeman^ University of 
Massachusetts Medical Center, and U373 MG astrocyte 
cells were obtained from the ATCC (ATCC HTB 17) . Tyr*^- 

15 Substance P (Tyr-Arg-Pro-Lys-Pro-Gln-Gln-Phe-Phe-Gly- 
Leu-Met-NHg) [SEQ ID NO: 4] was synthesized by the 
Washington University Protein Chemistry Facility and was 
pxirif led to homogeneity by HPLC using the general 
procedures previously described (11) . Radioiodination 

20 of peptide was performed using the conventional 

chloramine T oxidative iodination procedure and HPLC 
purification of the monoiodo form of Tyr'^-*SP. 

RNA isolation > cDNA and genomic cloning > PCR 
25 methods and nucleotide sequence analvsis . The methods 
for BNA isolation, (poly (A) ^ RNA selection and cDNA 
synthesis are conventional and have been described 
previously (7, 8, 9). PCR was performed using a Perkin 
Elmer thermal cycler jas previously described (7) with 
30 IM-9 cDKA as target. Initially a cDNA was generated by 
PCR using oligonucleotide primers corresponding to G* 
protein coupled receptor membrane spanning domains II 
and VII. . A 671 bp cDNA was isolated, subcloned into 
BLUESCRIPT (pBS) and sequenced; it contained an open 
35 reading frame with 90 • 5% identity to the corresponding 
rat substance P receptor cDNA (6,7) and gene (9) 
sequence. This plasmid was termed pBS-hSPRII-VII and 
the inserted cDKA corresponded to nucleotides 237*908 of 
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that shown in Figure 1. The 5' and 3' extents of the 
hSPR cDNA coding region as well as nontranslated 
sequences were determined by isolation and 
characterization of human SPR genomic exons i and 5, 
5 respectively,, using the pBS-hSPR II-VII cDNA insert and 
rat genomic exons (9) as probes, and is shown in Figure 
1. An amplified human X Dash II genomic library 
(Stratagene, La Jolla, CA) was screened and 10 positive 
phage were isolated and characterized by restriction 
10 mapping and hybridization analysis. Hybridizing 

sequences corresponding to exons 1-5 were Identified, 
and exons 1 and 5 were isolated as 1.2 kb EcoRl and 1.4 
kb EcoRl fragments, subcloned and sequenced. The 
predicted coding region of the human SPR was generated 
15 by PCR with IM-9 cDNA by using oligonucleotides 
corresponding to the coding region 5» 
(5'CCACCATGGATAAOGTCCTCCCGGTG 3') [SEQ ID NO: 5] and 
3» (antisense, 5 • CTAGGAG5AGCAGATTGGAG6AGAA3 • ) [SEQ ID 
NO: 6] ends as primers. The cDNA generated was isolated 
20 by agarose gel electrophoresis and was blunt-end ligated 
into Smal-digested pBS. Electroporation of bacterial 
cells with the ligated DNA yielded multiple isolates 
that were further analyzed by restriction mapping and by 
nucleotide sequence smalysis. One cDNA (corresponding 
25 to bases -5 to +1227 of that shown in Fig. i) was 
isolated after restriction with Hindlll and BamHI 
(present in the pBS polylinker) , and was made blunt- 
ended with Klenow fragment. The pM* was also blunt- 
ended with Klenow after BamHI digestion. The cDNA was 
ligated to pH^, and was used to transform E. coli XL-l 
Blue cells by electroporation. Colonies containing 
inserts were identified and the orientation of inserts 
was determined by restriction analysis. Two plasmids, 
called pM^-hSPR and pM^-hSPR antisense, were identified. 
35 Sequence analysis was performed as described previously 
(7, 8, 9). 



30 
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Transfection of COS-7 cells, llaand binding tests 
and inositol 1> 4, 5 trisphosphate assay > COS-7 cells 
plated at 50 to 90% confluence were transfected by 
conventional means as previously described (7) . Cells 
5 harvested 48-72 hours after transfections were incubated 
with ^^I-Tyr'^'-SP (for 2 hours at 4") and binding was 
determined by a conventional rapid filtration assay as 
previously described (11) . Typical binding tests were 
performed with approximately 150,000 transfected cells 

10 per assay tube. Competition binding was performed by 
adding the competitor prior to that of radiolabelled 
ligemd. Ligand binding data was analyzed by the LIGAND 
program (12). Cellular inositol 1,4^5-trisphosphate 
level was determined with a radioreceptor assay (13) 

15 with rat cerebellar membranes (14) using conventional 
extraction and assay conditions as previously described 
(15) . 

RNA blot and solution hybridizations . These were 
20 performed by conventional procedures as described 

previously (8, 9, 16, 17) . A random-primer labeled cDNA 
was prepared with Klenow fragment of DNA polymerase I 
for the pBS'-hSPRII-VXI cDNA insert, and an antisense RNA 
was prepared by transcription using T7 SNA polymerase and 
25 EcoRl linearized pBS-hSPRII-VII. RNA gels (1.0%) were 
blotted onto Nytran membranes, and the protected RNA 
species from solution hybridization tests were 
electrophoresed on 6% polyacrylamide gels containing 7M 
urea. Autoradiography was performed at -70 • with an 
30 intensifying screen. 

RESULTS 

A human SPR cDNA fragment corresponding to 
35 nucleotides +237 to +908 in Figture 1 was generated by 
PCR from CDNA prepared from IH*9 lymphoblast cell RNA 
using conventional procedures previously described (7) . 
The 5* end of the coding region was determined by 
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isolation and sequence analysis of the hiiman SPR gene 
exon 1, and the 3* end of the cDHA was determined by 
isolation and sequence analysis of the human SPR gene 
exon 5 as described in Methods hereinbefore. These 
5 sequences provided the 5* and 3' translated sequences of 
the human SPR, and a PCR using lM-9 cell cDNA was used 
to generate a full coding region containing cDNA. This 
cDNA was subcloned into the pM^ expression vector in 
which the cDNA is under the control of the Harvey murine 

10 sarcoma virus LTR (10) and was used for functional 
expression. 

COS-7 cells were transfected with pM^-hSPR, 
pM^hSPR antisense and pM^SPR, three plasmids that 
contain the human SPR cDNA, the human SPR cDNA inserted 

15 in the antisense orientation jand the rat SPR cDNA (7) , 
and 48 to 72 hours later the cells were examined for 
binding of ^^I-Tyr^^-SP x:iang a rapid filtration assay. 
Figure 2A shows these results in which the human and rat 
SPR construct transfected cells bind 15,000 to 25,000 

20 cpm ligand that is displaced by 1 iM SP. Nontransfected 
cells or cells transfected with pM^hSPR antisense showed 
no specific binding. Consequently, ligand displacement 
and saturation analyses were performed with the pM'hSPR 
construct to determine whether the binding site 

25 corresponded pharmacologically to that of the SPR or so- 
called NK-1 type tachykinin binding site. Figure 2B 
shows that at 10 nM SP or physalaemin, specific '^^I- 
Tyr-'^-SP binding was reduced by 85 to 95%, whereas 10- 
fold higher concentrations of tachykinins potent at NK- 

30 2 and NK-3 receptors, including neurokinin A, neurokinin 
B, neuropeptide 7, neuropeptide K, eledoisin and 
senktide, were much less potent in displacing 
radiolabelled ligand binding. Also, substance P free 
acid was much less potent in this regeurd, thereby 

35 demonstrating the importance of the substance P 

carboxyamide moiety in ligand binding. Additional tests 
were performed with SP, NKA and NKB at various doses to 
determine the IC50 values for displacing ^^I-Tyr'^-SP 
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binding, and these data are shown in Figure 3. SP was 
the most potent displacer of ligand binding compared to 
NKA and NKB^ with ICgp values of 0.72 ± 0.9 nM, 0.63 ± 
0.06 fM, and 1.12 ± 0.21 MM, respectively r each with 
5 Hill coefficients of 0.94 - 0.96. Therefore^ this 
cloned human cDNA encodes a sequence in transfected 
cells that upon ligand binding analysis has the 
characteristics of a SPR or NK-1 type tachykinin binding 
site. Saturation analysis of ^^I-Tyr*^-SP binding was 

10 performed to determine the affinity and relative number 
of binding sites expressed by pM^hSPR transfection of 
COS-7 cells. Figure 4 shows these results; Scatchard 
analysis of this data by the ligand program (12) 
provides a 1 site fit with a kj value of 0.24 ± 0.01 nM, 

15 with an average of 151,000 ± 8,000 sites expressed per 
cell. 

Transiently transfected COS-7 cells were 
stimulated with 1 /iH SP to determine cellular inositol- 
1,4,5 trisphosphate responses. Two tests were performed 

20 in which time points after stimulation of 5, 10, 15, 20, 
30, 60 and 120 seconds were analyzed. A transient 
response of 2.5 to 3-fold above resting levels was 
observed (Figure 5} at 10 to 15 seconds after 
stimulation with a retuam to basal level by 20 to 30 

25 seconds. 

Some patterns of humsui SPR RNA expression were 
also examined using RNA isolated from cell lines or 
tissues, and these data are. shown in Figure 6. By 
northern blot analysis, a single hybridizing species of 

30 approximately 4.5 kb was identified in poly (A)* RNA 
isolated from IM-9 cells but not in liver. Similar 
hybridizing species were observed with both an RNA 
coding region probe and a DNA probe corresponding to 
exon I of the SPR gene. The coding region antisense RNA 

35 probe was also used in a more sensitive solution 
hybridization-nuclease protection test in which the 
probe corresponds to a 712 base sequence which when 
annealed with hSPR mRNA, will protect a species of 671 
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bases. Figure 6 shows that this probe will protect a 
species of 671 bases in IM-9 and U373 cell RNA 
preparation and also in spinal cord and lung RNA 
preparations, that correspond to SPR nRNA; these are not 
observed in HepG2 cell or liver RNA preparations. In 
IM-9 cell RNA preparations, two additional protected 
species of approximately 150 bases and 350 bases were 
noted. These species have not been completely 
characterized, but they may correspond to exon 1 
protected species and exons 1 plus 2 protected species 
which would be present in partially spliced RNAs. These 
have been observed in rat tissues (9) and appear to 
correspond to slowly spliced nuclear SPR RNA precursors.' 
Figure 7 shows a comparison of primary structures 
15 of human and rat SPR protein, as deduced from cDNA 
cloning and sequence analysis. Twenty-two of the 407 
residues are different between the sequences; these 
differences are distributed throughout and are generally 
conservative. Both sequences encode receptors with 7 
20 putative o-helical transmembrane domains based on 

hydrophobicity plotting and by comparisons to other G- 
protein coupled receptors. The human SPR has 2 N-linked 
glycosylation sites in the amino terminal domain, and a 
potential palmitoylation site (cys-323) 15 residues 
25 carboxyl terminal to the MVII transmembrane domain. 

Multiple potential serine and threonine phosphorylation 
sites exist in the 3rd cytoplasmic and carboxyl terminal 
domain and many of these are conserved between the two 
sequences. The carbo;cyl terminal tail region is 
30 separated by an acidic region about half way into the 
sequence, which separates the two Ser/Thr rich regions. 

The foregoing results indicate that the novel 
human and previously characterized rat (6,7) SPR's show 
a sequence identity of 94.6% throughout the entire 
35 primary structure, sequences within all 7 putative o- 
helical transmembrane domains and within many regions of 
both extracellular and intracellular domains are 
conserved. These include two consensus N-linked 
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glycosylation sites in the amino terminal region, a 
conserved consensus palmitoylation site (Cys-323) 
carboxyl terminal to transmembrane domain VII, and 
multiple potential serine and threonine phosphorylation 
5 sites on intracellular domains, especially that of the 
third cytoplasmic domain and in the carboxyl terminal 
region. Two regions shoving divergence are scattered in 
the amino terminal region and are clustered near the 
carboxyl terminus within the carboxyl terminal domain. 
10 Based upon the high degree of sequence identity it is 
likely that sequences within the primary structure 
essential for G-protein coupling, high affinity agonist 
binding, and desensitization of receptor responses are 
conserved. 

15 

Various other examples will be apparent to the 
person skilled in the art after reading the present 
disclosiire without depeorting from the spirit and scope 
of the invention, and, it is intended that all such other 
20 examples be included within the scope of the appended 
claims. 
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SEQUSNCS LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS Krause^ James E. 
(ii) TITLE OF INVENTION: Human Substance P Receptor 
(ill) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scott J. Meyer, Monsanto Co., A3SD 

(B) STREET: 800 N. Lindbergh Blvd. 
<C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: U.S. A 

(F) ZIP: 63167 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Meyer, Scott J. 

(B) REGISTRATION NUMBER: 25,275 

(C) REFERENCE/DOCKET NUMBER: 07-24 (776) A 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (314)694-3117 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; XI amino acids 

(B) TlfPB: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TXPE: peptide 

(ix) FEATURE: 

(A) NAME/KEX: Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /Xabel=^ amide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Arg Pro Lys Pro Glu Gin Phe Phe Gly Leu Met 
15 10 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 211.. 1431 
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(xi) SEQUENCE DESCRIP^TION: SEQ ID N0:2: 

MTTCAGAGC CACC6C666C A6GCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCX5C 60 
CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGXCTGCTTT 120 
AGAAGGACCC TGAGCCCCAG GOGCGAGCCA CS^ACTCTG CTGCAGAGGG GGGTTGTGTA 180 

CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATG GAT AAC GTC CTC CCG GTG GAC 234 

Het Asp Asn Val Leu Pro Val Asp 
1 5 

TCA GAC CTC TCC CCA AAC ATC TCC ACT AAC ACC TCG GAA CCC AAT CAG 282 
Ser Asp Leu Ser Pro Asn He Ser Thr Asn Thr Ser Glu Pro Asn Gin 
10 15 . 20 

TTC GTG CAA CCA GCC TGG CAA ATT GTC CTT T6G GCA GCT 6CC TAG ACG 330 
Phe Val Gin Pro Ala Trp Gin He Val Leu Trp Ala Ala Ala Tyr Thr 
25 30 35 40 

GTC ATT GTG 6T0 ACC TCT GTG GTG GGC AAC GTG GTA GTG ATG TGG ATC 378 
Val He Val Val Thr Ser Val Val Gly Asn Val Val Val Met Trp He 
45 50 55 

ATC TTA GCC CAC AAA AGA ATG AGG ACA GTG ACG AAC TAT TTT CTG GTG 426 
He Leu Ala His Lys Arg Met Arg Thr Val Thr Asn Tyr Phe Leu Val 
60 65 70 

AAC CTG GCC TTC GCG GAG GCC TCC ATG GCT GCA TTC AAT ACA GTG GTG 474 
Asn Leu Ala Phe Ala Glu Ala Ser Met Ala Ala Phe Asn Thr Val Val 
75 80 85 

AAC TTC ACC TAT GCT GTC CAC AAC GAA TGG TAC TAC GGC CTG TTC TAC 522 
Asn Phe Thr Tyr Ala Val His Asn Glu Trp Tyr Tyr Gly Leu Phe Tyr 
90 95 100 

TGC AAG TTC CAC AAC TTC TTT CCC ATC GCC GCT GTC TTC GCC ACT ATC 570 
Cys Lys Phe His Asn Phe Phe Pro He Ala Ala Val Phe Ala Ser He 
110 115 120 
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TAG TCC ATG ACS GOT GTG GCC TTT GAT AGG TAC ATG GCC ATC ATA CAT 618 
Tyr Ser Met Thr Ala Val Ala Phe Asp Arg Tyr Met Ala He He His 
125 130 135 

CCC CTC CA6 GCC CGG CTG TCA GCC ACA GCC ACC AAA GTG GTC ATC TGT 666 
Pro Leu Gin Pro Arg Leu Ser Ala Thr Ala Thr Lys Val Val He Cys 
140 145 150 

GTC ATC TGG GTC CTG GCT CTC CTG CTG GCC TTC CCC CAG GGC TAC TAC 714 
•Val He Trp Val Leu Ala Leu Leu Leu Ala Phe Pro Gin Gly Tyr Tyr 
155 160 165 

TCA ACC ACA GAG ACC ATG CCC AGC AGA GTC GTG TGC ATG ATC GAA TGG 762 
Ser Thr Thr Glu Thr Met Pro Ser Arg Val Val Cys Met He Glu Trp 
170 175 180 

CCA GAG CAT CCG AAC AAG ATT ^AT GAG AAA GTG TAC CAC ATC TGT GTG 810 
Pro Glu His Pro Asn Lys He Tyr Glu Lys Val Tyr His He Cys Val 
185 190 195 200 

ACT GTG CTG ATC TAC TTC CTC CCC CTG CTG GTG ATT GGC TAX GCA TAC 858 
Thr Val Leu He Tyr Phe Leu Pro Leu Leu Val He Gly Tyr Ala Tyr 
205 210 215 

ACC GTA GTG GGA ATC ACA CTA TGG GCC AGT GAG ATC CCC GGG GAC TCC 906 
Thr Val Val Gly He Thr Leu Trp Ala Ser Glu He Pro Gly Asp Ser 
220 225 230 

TCT GAC CGC TAC CAC GAG CAA GTC TCT GCC AAG CGC AAG GTG GTC AAA 954 

Ser Asp Arg Tyr His Glu Gin Val Ser Ala Lys Arg Lys Val Val Lys 

235 240 245 

* » 

ATG ATG ATT GTC GTG GTG TGC ACC TTC GCC ATC TGC TGG CTG CCC TTC 1002 
Met Met He Val Val Val Cys Thr Phe Ala He Cys Trp Leu Pro Phe 
250 255 260 
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CAC ATC TTC TTC CTC CTG CCC TAC ATC AAC CCA GAT CTC TAC CTG AAG 1050 
His Jle Phe Phe Leu Leu Pro Tyr He Asn Pro Asp Leu Tyr Leu Lye 
265 270 275 280 

AAG TTT ATC CAG CAG GTC TAC CTG GCC ATC ATG TGG CTG GCC ATG AGC 1098 
Lys Phe He Gin Gin Val Tyr Leu Ala He Met Trp Leu Ala Met Ser 
285 290 295 

TOO ACC ATG TAC AAC CCC ATC ATC TAC TGC TGC CTC AAT GAC AGG TTC 1146 
Ser Thr Met Tyr Asn Pro He He Tyr Cys Cys Leu Asn Asp Arg Phe 
300 305 310 

CGT CTG GGC TTC AAG CAT GCC TTC OGG TGC TGC CCC TTC ATC AGC GCC 1194 
Arg Leu Gly Phe Lys His Ala Phe Arg Cys Cys Pro Phe He Ser Ala 
315 320 325 

GGC GAC TAT GAG GGG CTG 6AA ATG AAA TCC ACC CGG TAT CTC CAG ACC 1242 
Gly Asp Tyr Glu Gly Leu Glu Met Lys Ser Thr Arg Tyr Leu Gin Thr 
330 335 340 
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CAG 6GC AGX 6T6 TAC AAA <^TC AGO C6C CT6 GAG ACC ACC ATC TCC ACA 1290 
Gin Gly Ser Val Tyr Lys Val Ser Arg Leu Glu Thr Thr He Ser Thr 
345 350 355 360 

GTG GTG GGG GCC CAC GAG GAG GAG CCA GAG GAC GGC CCC AAG GCC ACA 1338 
Val Val Gly Ala His Glu Glu Glu Pro Glu Asp Gly Pro Lye Ala Thr 
365 370 375 

CCC TCG TCC GTG GAC CT& ACC TCC AAC T6C TCT TCA CGA AGT GAC TCC 1386 
Pro Ser Ser Leu Asp Leu Thr Ser Asn Cys Ser Ser Arg Ser Asp Ser 
380 385 390 

AAG ACC ATG ACA GAG AGC TTC AGC TTC TCC TCC AAT GTG CTC TCC 1431 
Lys Thr Ket Thr Glu Ser Phe Ser Phe Ser Ser Asn Val Leu Ser 
395 400 405 

TAGGCCACAG G6CCTTTGGC AGGTGCAGCC CCCACTGCCT TTGACCTGCCTCCCTTCATG 1491 

CATGGAAATT CCCXTCATCT 6GAACCATCA GAAACACCCT CACACTGGGA CTTGCAAAAA 1551 

GGGTCA6TAT GGGTTAGGGA AAACATTCCA TCCTTGA6TC AAAA2UITCTC AATTCTTCCC 1611 

TATCTTTGCC ACCCTCATGC TGTGTGACTC AAACCAAATC ACTGAACTTT GCTGAGCCTG 1671 

TAAAATAAA& GGTC6GACCA GCTTTTCCTC AA6AGCCCAA TGCATTCCAT TTCT6GAAGT 1731 

GACTTTGGCT GCATGC6AGT GCTCATTTCA GGAT6 1766 



(2) INFOBUATXOlf FOR SEQ ID NOs3: 

(i) SEQUENCE CH&IUICTERISTICS : 

(A) LENGTH; 407 amino acids 

(B) TYPES amino acid 
(D) TOPOLOOy: linear 

(ii) HOLECOLE TYPE: protein 
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(Xi) SEQUENCE DBSCEIPTION: SEQ ID N0i3: 

Met Asp Asn Val Leu Pro Val Asp Ser Asp Leu Ser Pro Asn lie Ser 
15 10 15 

Thr Asn Thr Ser Glu Pro Asn 61n Phe Val Gin Pro Ala Trp Gin He 
20 25 30 

Val Leu Trp Ala Ala Ala Tyr Xhr Val He Val Val Thr Ser Val Val 
35 40 45 

Gly Asn Val Val Val Met Trp He He Leu Ala His Lys Arg Met Arg 
50 55 60 

Thr Val Thr Asn Tyr Phe Leu Val Asn Leu Ala Phe Ala Glu Ala Ser 
€5 70 75 80 

Met Ala Ala Phe Asn Thr Val Val Asn Phe Thr Tyr Ala Val His Asn 
85 90 95 

Glu Trp Tyr Tyr Gly Leu Phe Tyr Cys Lys Phe His Asn Phe Phe Pro 
100 105 110 

He Ala Ala Val Phe Ala Ser He Tyr Ser Met Thr Ala Val Ala Phe 
H5 120 125 

Asp Arg Tyr Met Ala He He His Pro Leu Gin Pro Arg Leu Ser Ala 
130 135 140 

Thr Ala Thr Lys Val Val He Cys Val He Trp Val Leu Ala Leu Leu 
"5 150 155 160 

Leu Ala Phe Pro Gin Gly Tyr Tyr Ser Thr Thr Glu Thr Met Pro Ser 
165 170 175 

Arg Val Val Cys Met He Glu Trp Pro Glu His Pro Asn Lys He Tyr 
180 185 190 
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Glu Lys Val Tyr His lie Cya Val Thr Val Leu lie Tyr Fhe Leu Pro 
195 200 205 

Leu Leu Val lie Gly Tyr Ala Tyr Thr Val Val Gly lie Thr Leu Trp 
210 215 220 

Ala Ser Glu Xle Pro Gly Asp Ser Ser Asp Arg Tyr His Glu Gin Val 
225 230 235 240 

Ser Ala Lys Arg Lys Val Val Lys Met Met He Val Val Val Cys Thr 
245 250 255 

Phe Ala He Cys Trp Leu Pro Phe His lie Phe Phe Leu Leu Pro Tyr 
260 265 270 

Xle Asn Pro Asp Leu Tyr Leu Lys Lys Phe He Gin Gin Val Tyr Leu 
275 280 285 

Ala lie Met Trp Leu Ala Mfet Ser Ser Thr Met Tyr Asn Pro He He 
290 295 300 

Tyr Cys Cys Leu Asn Asp Arg Phe Arg Leu Gly Phe Lys His Ala Phe 
305 310 315 320 

Arg Cys Cys Pro Phe He Ser Ala Gly Asp Tyr Glu Gly Leu Glu Met 
325 330 335 

Lys Ser Thr Arg Tyr Leu Gin Thr Gin Gly Ser Val Tyr Lys Val Ser 
340 345 350 

Arg Leu Glu Thr Thr He Ser Thr Val Val Gly Ala His Glu Glu Glu 
355 360 365 



Pro Glu Asp Gly Pro Lys Ala Thr Pro ser Ser Leu Asp Leu Thr Ser 
370 375 380 
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Asn Cys Ser Ser Arg Ser Asp Ser Lys Thr Met Thr Glu Ser Phe Ser 
385 390 395 400 

Phe Ser Ser Asn Val Leu Ser 
405 

(2) ZNFOHMATION FOR SBQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels amide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Tyr Arg Pro Lys Pro Gin Gin Phe Phe Gly Leu Met 
1 5 ' 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE OSSCRIPTIOK: SEQ ID KO:5: 
CC21CCAT66A TAAC6TCCTC C0GGX6 
(2) INF0RH21TI0K FOR SEQ ID N0x6: 

(i) SEQOEKCS CHARACTERISTICS: 

(A) LENGTH? 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) HOLECDLE TYPE: cDNA 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESOIIPTION: SEQ ID NO: 6: 
CTAGGA6AGC ACATT66AGG A6AA 
(2) INFORMATION FOR SEQ ID NOt7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 amino acids 

(B) TYPE: axnino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) UmECULB TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 7: 

Met Asp Asn Val Leu Pro Met Asp Ser Asp Leu Phe Pro Asn He Ser 
15 10 15 

Thr Asn Thr Ser Glu Ser Asn Gin Phe Val Gin Pro Thr Trp Gin He 
20 25 30 

Val Leu Trp Ala Ala Ala Tyr Thr Val He Val Val Thr Ser Val Val 
35 40 45 

Gly Asn Val Val Val He Trp He He Leu Ala His Lys Arg Met Arg 
50 55 60 

Thr Val Thr Asn Tyr Phe Leu Val Asn Leu Ala Phe Ala Glu Ala Cys 
65 70 75 80 

Met Ala Ala Phe Asn Thr Val Val Asn Phe Thr Tyr Ala Val His Asn 
85 90 95 

Val Trp Tyr Tyr Gly Leu Phe Tyr Cys Lys Phe His Asn Phe Phe Pro 
100 105 110 

He Ala Ala Leu Phe Ala Ser He Tyr Ser Met Thr Ala Val Ala Phe 
115 120 125 

Asp Arg Tyr Met Ala He He His Pro Leu Gin Pro Arg Leu Ser Ala 
130 135 140 

Thr Ala Thr Lys Val Val He Phe Val He Trp Val Leu Ala Leu Leu 
145 150 155 160 

Leu Ala Phe Pro Gin Gly Tyr Tyr Ser Thr Thr Glu Thr Met Pro Ser 
165 170 175 



Arg Val Val Cys Met He Glu Trp Pro Glu His Pro Asn Arg Thr Tyr 
180 185 190 
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Glu Lys Ala Tyr His lie Cys Val Thr Val Leu He Tyr Phe Leu Pro 
195 200 205 

Leu Leu Val lie Gly Tyr Ala Tyr Thr Val Val Gly He Thr Leu Trp 
210 215 220 

Ala Ser Glu He Pro Gly Asp Ser Ser Asp Arg Tyr His Glu Gin Val 
225 230 235 240 

Ser Ala Lys Arg Lys Val Val Lys Met Met He Val Val Val Cys Thr 
245 250 255 

Phe Ala He Cys Trp Leu Pro Phe Bis Val Phe Phe Leu Leu Pro Tyr 
260 265 270 

He Asn Pro Asp Leu Tyr Leu Lys Lys Phe He Gin Gin Val Tyr Leu 
275 280 285 

Ala Ser Met Trp Leu Ala Met Ser Ser Thr Met Tyr Asn Pro He He 
290 295 300 

Tyr Cys ^s Leu Asn Asp Arg Phe Arg Leu Gly Phe Lys His Ala Phe 
305 310 315 320 

Arg Cys Cys Pro Phe He Ser Ala Gly Asp Tyr Glu Gly Leu Glu Met 
325 330 335 

Lys Ser Thr Arg Tyr Leu Gin Thr Gin Ser Ser Val Tyr Lys Val Ser 
340 345 350 

Arg Leu Glu Thr Thr He Ser Thr Val Val Gly Ala His Glu Glu Glu 
355 360 365 



Pro Glu Glu Gly Pro Lys Ala Thr Pro Ser Ser Leu Asp Leu Thr Ser 
370 375 380 
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Aan Gly Ser ser Arg Ser Asn Ser Lya Thr Met Thr Glu Sec Ser Ser 
385 390 39S 400 



Fhe Tyr Ser Asn Met Leu Ala 
405 
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WHAT IS CLAIMED IS: 

1. A recombinant DNA sequence comprising a 
sequence encoding human substance P receptor protein 

5 having the amino acid sequence shown in PIG, 1. 

2. Human substance P receptor cDHA having the 
nucleotide sequence shown in PIG. 1. 

10 3. A process which comprises expressing a cDNA 

encoding human substance P receptor having the amino 
acid sequence shown in PIG. 1 in a bacterial or 
mammalian cell culture transformed with a DNA expression 
vector containing said gene operably linked to 

15 transcription and translation sequences in said vector 
and recovering said human substance P receptor. 

4. The process of Claim 3 in which the DNA 
expression vector is plasmid pl^hSPR. 

20 

5. CHO cells transformed with a DNA e:^ression 
vector containing a cDNA encoding human substance P 
receptor having the amino acid sequence shown in FIG. l 
operably linked to transcription and translation 

25 sequences in said vector. 

6. The cells of Claim 5 in which the DNA 
expression vector is plasmid pI^hSPR. 

30 7. Plasmid pM^hSPR. 

8. Cell line CHO-pM^hSPR #10 (ATCC CBL 10824). 
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